A novel database of disulfide patterns and its application to the discovery of distantly related homologs.

نویسندگان

  • Herman W T van Vlijmen
  • Abhas Gupta
  • Lakshmi S Narasimhan
  • Juswinder Singh
چکیده

Disulfide bonds are conserved strongly among proteins of related structure and function. Despite the explosive growth of protein sequence databases and the vast numbers of sequence search tools, no tool exists to draw relations between the disulfide patterns of homologous proteins. We present a comprehensive database of disulfide bonding patterns and a search method to find proteins with similar disulfide patterns. The disulfide database was constructed using disulfide annotations extracted from SwissProt, and was expanded significantly from 16,736 to 94,499 disulfide-containing domains by an inference method that combines SwissProt annotations with Pfam multiple alignments. To search the database, we define a disulfide description, called the disulfide signature, which encodes both spacings between cysteine residues and cysteine connectivity. A web tool was developed that allows users to search for related disulfide patterns and for subpatterns resulting from the removal of one or more disulfides from the pattern. We explore the possibility of using disulfide pattern conservation to identify protein homologs that are undetectable by PSI-BLAST. Examples include the homology between a sea anemone antihypertensive/antiviral protein and a sea anemone neurotoxin, and the homology between tick anticoagulant peptide and bovine trypsin inhibitor. In both examples, there is a clear structural similarity and a functional relationship. We used the database to find structural homologs for the Cripto CFC domain. The identification of a von Willebrand Factor C (VWFC)-like domain agrees with its functional role and explains mutation data. We believe that the rapid increase in structure determinations arising from structural genomics efforts and advances in mass spectrometry techniques will greatly increase the number of disulfide annotations. This information will become a valuable resource for structural and functional annotations of proteins. The availability of a searchable disulfide pattern database will thus provide a powerful new addition to existing homolog discovery methods.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

P-215: Discovery of A Novel APA Variant of A Human Potential Gene Based on Expressed Sequenced Tags Analysis

Background: Expressed sequence tags (ESTs) are sequences of cDNA fragments prepared from different tissue sources. There are over one million of these sequences in the publicly available database, and these sequences are believed to represent more than half of all human genes. The ESTs belong to different cDNA libraries, was prepared from one particular cell type, organ, or tumor. Therefore, th...

متن کامل

A Proposed Data Mining Methodology and its Application to Industrial Procedures

Data mining is the process of discovering correlations, patterns, trends or relationships by searching through a large amount of data stored in repositories, corporate databases, and data warehouses. Industrial procedures with the help of engineers, managers, and other specialists, comprise a broad field and have many tools and techniques in their problem-solving arsenal. The purpose of this st...

متن کامل

Discovery of Novel Glucagon Receptor Antagonists Using Combined Pharmacophore Modeling and Docking

Glucagon and the glucagon receptor are most important molecules control over blood glucose concentrations. These two molecules are very important to studies of type 2 diabetic patients. In literature, several classes of small molecule antagonists of the human glucagon receptor have been reported. Glucagon receptor antagonist could decrease hepatic glucose output and improve glucose control in d...

متن کامل

Discovery of Novel Glucagon Receptor Antagonists Using Combined Pharmacophore Modeling and Docking

Glucagon and the glucagon receptor are most important molecules control over blood glucose concentrations. These two molecules are very important to studies of type 2 diabetic patients. In literature, several classes of small molecule antagonists of the human glucagon receptor have been reported. Glucagon receptor antagonist could decrease hepatic glucose output and improve glucose control in d...

متن کامل

Review of NKG2D function and its related ligands: review article

The natural killer group 2D (NKG2D) is a transmembrane protein and a member of the CD94/NKG2 family of C-type lectin-like receptors. NKG2D is encoded by the KLRK1 gene, which is located in the NK-gene complex (NKC) placed on chromosomes 6 and 12 in mice and humans, respectively. NKG2D forms a homodimer structure and binds through ectodomains with its related ligands. Each of its monomers consis...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Journal of molecular biology

دوره 335 4  شماره 

صفحات  -

تاریخ انتشار 2004